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Abstract 

We consider a join-the-shortest-queue model which is as follows. There are K single FIFO 
servers and M arrival processes. The customers from a given arrival process can be served only 
by servers from a certain subset of all servers. The actual destination is the server with the 
smallest weighted queue length. The arrival processes are assumed to obey a large deviation 
principle while the service is exponential. A large deviation principle is established for the 
queue-length process. The action functional is expressed in terms of solutions to mathematical 
programming problems. The large deviation limit point is identified as a weak solution to a 
system of idempotent equations. Uniqueness of the weak solution is proved by establishing 
trajectorial uniqueness. 



1 Introduction and summary 

Motivation Queueing models with the join-the-shortest-queue (abbreviated further as JSQ) 
mechanism for routing arriving customers are of interest in various application areas, see, e.g., 
Fleming and Simon ^1], Turner [HTj. In a generic set-up, one considers a queueing system con- 
sisting of service stations arranged in parallel. The customers arrive exogenously and join the 
station that has the least number of customers to serve. If there are several stations that have 
fewest customers, then some rule is applied to direct an arriving customer to one of these stations. 
The model admits many versions: along with the stream of the "discretionary" customers that 
choose the station with the least number of customers there may be "dedicated" customers who 
can only be served at specific stations, there may be several streams of discretionary customers 
each of which can join only the stations from a certain subset of the set of all stations, the number 
of servers at the service stations can vary from one to infinity, the stations may be assigned weights 
so that the discretionary customers join the station with the least weighted number of customers, 
upon service completion the customers may either leave the system or be routed back, and so on. 
However, all these versions share the common feature that the arrival rates at the stations depend 
discontinuously on the numbers of customers present. More precisely, the rate at which a given 
station receives customers experiences a jump when the set of the stations with fewest customers 
changes. 
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Thus, JSQ models fall in the category of stochastic systems with discontinuous dynamics (or 
discontinuous stochastic dynamical systems). The dynamical properties of such a system change 
abruptly when it enters a certain domain of the state space. In this paper we concern ourselves 
with a study of the large deviation principle (LDP) for the queue lengths in a JSQ model. Large 
deviations of discontinuous stochastic dynamical systems have received considerable attention in the 
literature, see Alanyali and Hajek [2], Atar and Dupuis [3], Blinovski and Dobrushin H, Borovkov 
and Mogulskii [H], Boue, Dupuis and Ellis Dupuis and Ellis [HinSlE], Dupuis, Ellis and Weiss 
[T2] . Ignatiouk ^j^], Korostelev and Leonov [2U1[^, Majewski (23], Ramanan and Dupuis [2*5] . 
Shwartz and Weiss [SOI- However, available results on the LDP for JSQ are confined to the setting 
of two stations, see Ridder and Shwartz and references therein. Even for that case we have not 
been able to find in the literature an explicit formulation of an LDP for the queue length process 
with an indication of the associated action functional. Paths to overflow have been studied in Foley 
and McDonald JH]) Ridder and Shwartz ,29 , and Turner [3^. A version of the model where the 
stations are represented by infinite servers has been analysed by Alanyali and Hajek who for the 
case of two stations establish an LDP for the queue length trajectories and study overflow paths, 
and Turner |31| who contrasts the results for the single-server and infinite-server cases. 

Model We consider a JSQ model with an arbitrary number of arrival processes and an arbitrary 
number of service stations which are represented by single servers with the FIFO service discipline. 
The customers from a given arrival process can be served only by the servers from a certain subset of 
all servers. The actual destination is the server with the smallest weighted queue length. The model 
thus incorporates both dedicated and discretionary flows of customers. The arrival processes are 
general and are only assumed to obey an LDP. The service times are associated with the servers 
and are exponentially distributied. (In fact, our main result concerns a more general setting of 
autonomous service.) Upon service completion, the customers depart from the system. 

Methods We adopt the strategy that has proved to be useful for establishing LDPs for contin- 
uous dynamical systems. It is based on the characterisation of large deviation relatively compact 
sequences as exponentially tight ones, see, e.g., Puhalskii [2U, EH! The large deviation limit point 
is identified as a weak solution to idempotent equations. The latter equations are obtained as large 
deviation limits of the stochastic equations governing the original dynamical system. 

However, this general method has to be modified for stochastic systems with discontinuous 
dynamics, in particular, for the JSQ model we are concerned with here. Building on the approach 
of Puhalskii [22], we first replace the original discontinuous equations with certain continuous ones, 
to which the limit procedure is applied. We prove trajectorial uniqueness for the resulting system 
of idempotent equations. This implies uniqueness of the large deviation limit point, hence, an LDP. 
The trajectorial uniqueness is proved by showing the existence of a Lyapunov function. 

Contribution We establish an LDP for the queue length process considered as a random element 
of the associated Skorohod space. The action functional is of integral form and is expressed in terms 
of solutions to mathematical programming problems. We also provide some insight into the relation 
between weak and trajectorial uniqueness for idempotent equations which is instrumental in the 
proof. Besides, uniqueness of solutions for a fluid version of the JSQ model is established. 

In broader perspective, the approaches of this paper and those of Puhalskii [27] provide new 
tools for the study of large deviations of stochastic systems with discontinuous dynamics. The 
system of idempotent equations that needs to have a weakly unique solution in order for an LDP 
to hold has the form of a system of differential equations. The latter system can be viewed as 
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describing a fluid version of the original queueing model where the arrival, service and queue length 
processes are represented by absolutely continuous functions. One of the implications of the results 
of this paper is that if, given trajectories of exogenous arrival and service processes ("the inputs"), 
the system of differential equations has a unique solution for the queue length trajectory ("the 
output") then weak uniqueness for the associated idempotent equations holds, so an LDP holds. 
In short, trajectorial uniqueness for the fluid model implies an LDP. In fact, one can allow the fluid 
model to have a unique solution for a certain subset of the set of all "inputs" , cf . , Puhalskii [27] . 
As the application in this paper shows, the method does not restrict the number of domains of 
constant dynamics sharing a common boundary, while such constraints are inherent in the existing 
techniques. 

Organisation The paper is organised as follows. The main result is stated in Section^ Section[3] 
is concerned with its proof. The appendix reviews basics of idempotent probability and large 
deviation convergence. 

Notation, terminology, conventions We will say that a function I from a metric space E to 
[0, oo] is an action functional if it is lower compact that is the sets {z£E: I(z) < a} are compact for 
a £ R+ and inf zG E I(z) = 0. A sequence {P ra , n £ N} of probability measures on the Borel <r-algebra 
of E (or a sequence of random elements {X n , n £ N} with values in E and distributions P n ) is said to 
obey the LDP for scale n with the action functional I if limsup,^^ n _1 log P n (F) < — inf^g^ I(z) 
for each closed subset F of E and liminfy^oo n~ l logP n (G) > — inf 26 c I(z) for each open subset 
G of E. 

The set of natural numbers is denoted as N, the set of real numbers is denoted as R and the non- 
negative hairline is denoted as R + ; for £ £ N, M. e and R+ denote the cartesian products of I copies of 
R and R+, respectively, with product topology. Elements of R^ are considered as column- vectors, 
lg denotes the element of R^ with all the entries equal to one, superscript T is used to denote the 
transpose of a matrix. Inequalities involving vectors are understood componentwise. We denote 
as B(R_|_,R^) the Skorohod space of R^-valued right-continuous with left-hand limits functions on 
R+. It is assumed to be endowed with the Skorohod Ji-topology and metrised by a complete 
separable metric, see Ethier and Kurtz JSj) Liptser and Shiryaev |22j . Jacod and Shiryaev JH] for 
the definition and properties. The elements of D(R + ,R^) are denoted with lower-case bold-face 
Roman characters, e.g., x = (x(i), t £ R+). 

The following conventions are assumed: sums and infima over the empty set are equal to and 
oo, respectively, 0/0 = 0, and • (—00) = 0- oo = oo-0 = 0. For a £ R and j3 £ R, we denote 
as [a\ the integer part, a A = min(a,/3), a V f3 = max(a,/3), and a + = a V 0; l(r) denotes the 
indicator function of a set V that is equal to 1 on V and is equal to outside of V. We use dot to 
denote differentiation with respect to the time variable, so that x(t) denotes the time-derivative of 
an absolutely continuous function (x(i), t £ R+); if x(t) is a vector, then the notation signifies that 
each component is differentiated; x(i— ) denotes the left-hand limit at t; "a.e." refers to Lebesgue 
measure unless specified otherwise. All the relations involving derivatives are understood to hold 
a.e. 

2 The LDP for join the shortest queue 

Dynamics We provide a more specific model description and turn it into equations relating 
stochastic processes. There are K queues, indexed 1 through K, each with a single server. The 
servers are fed by M arrival processes indexed 1 through M. Customers from the m-th arrival 
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process can be served only by the servers from a nonempty subset S m of all servers. More specifically, 
an arriving customer is routed to the queue with the smallest weighted queue length: queue h G S m 
is assigned weight Wkm, where Wkm > 0, and the customer in question joins the queue with the 
least value of Xk/wk m , k = 1, 2, . . . , K, where Xk stands for the number of customers in queue k 
at the time when the arrival occurs. In case there are two or more queues with the least value of 
x k/wkm, the arrival is routed to one of the competing queues. The actual rule used for breaking 
ties is of no consequence for the results below. Note that in this set-up dedicated arrivals are those 
with one-element sets S m . 

We assume that the servers perform service autonomously in the following sense. Each server is 
assigned a point process (i.e., a piecewise constant nondecreasing process starting at zero with unit 
jumps) which specifies service completion times: if at the moment preceding a jump time there is 
a customer present in the queue, then this customer leaves the queue at the time of the jump. Due 
to the memoryless property of the exponential distribution, servers with exponential service times 
can be considered as servers with autonomous service. 

Our set-up concerns, in fact, a sequence of queueing models as described above indexed by 
n G N. To distinguish between the models, superscript n will be used in the notation for the 
entities associated with the n-th model. Let = (A m (t), t G R+) and = {B%(t), t G 
where m = 1, 2, . . . , M and k = 1, . . . , K, be one-dimensional point processes. It is assumed that 
^m(O) = ^fc(O) = 0- We let A m (t) model the cumulative number of exogenous arrivals in the 
m-th arrival process by time t and let B m {t) model the number of service completions by server m 
during time t of uninterrupted work of the server. All the processes are defined on a probability 
space (il,^ 7 , P) and have trajectories in the associated Skorohod spaces. Let Ck denote the set 
of m G {1,2,..., M} such that k G S m and Q^(t) denote the number of customers in queue k at 
time t. The process Q n = (Q n (t), t G where Q n (t) = (Q%(t), k= 1,2, . . . ,K), satisfies the 
following balance equations, where k = 1, 2, . . . , K and t G M+, 



t 

QUt) = QUO) + E [/ l(^=* < mm dAUs) 

k/ V w km «es m : Wl m ) 

m£C k g l-Lk 
t t 

+ [ 1 f9Mfzl =m i n 9l^zh a n {s)dA n {s) \_ 1 (Q»(»_)>0)dBy(.). (2.1) 
J V Wkm Ze5 m : Wlm > J J 

l^k 

In this equation, the random variables «/? m (s) account for the rule adopted for breaking ties between 
several queues of minimum weighted length, so one of the random variables ct^ m (s), k = 1, 2, . . . , K, 
equals 1 while the rest equal 0. The integrals on the right of (|2.1|) are well defined by being finite 
sums. Since the A m and B^ have piecewise constant trajectories with finite numbers of jumps on 
bounded intervals, (|2.1|) admits a unique solution for Q^(t), see Chen and Mandelbaum [7] for more 
extensive results. 



Statement of the LDP Let us be given a [0, oo]-valued Borel function ifj(z), where z G 
Let function I A > B : D(R+, R M+K ) -> R + be defined by the equality 

oo 

I A > B (z) = J V(z(t)) dt, (2.2) 



if the function z = (z(i), t G R+) is absolutely continuous, componentwise nondecreasing, and 
z(0) = 0, and I A ' B (z) = oo otherwise. It is assumed that 1 A ' B is an action functional on 
D(R_i_, M. M+K ). It follows that rp is an action functional itself. 
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For x = {x\, . . . , xk) G R:jf and y = (yi, • • • , Vk) G R let N(x,y) denote the set of a = 
(ai, . . . , a/vf) G R+' and 6 = (61, . . . , &rt) G R+ for which there exist matrices e = (e km ) G R+ xM 
and vectors d = (d\, . . . ,dj{) G R^ such that y = eljy/ — d, & T 1k < a, d < b, e km = if either 
A; or x k /w km > min/ 6Sm xi/wi m , and = a fc if x k > 0. Let 

L(*,y) = inf Ma,b). (2.3) 

(o,6)eiV(a:,j/) 

Since ip is lower compact and N(x, y) is closed, the infimum above is attained provided N{x, y) is 
nonempty. Besides, an easy compactness argument shows that the function L is lower semicontin- 
uous. In particular, it is Borel measurable. 
For q G R K and q G B(R + ,R X ), we define 



Ig(q) = | L(q(t),q(t))di 





if go £ R+ j the function q is componentwise nonnegative, absolutely continuous and q(0) = qo, and 
Igo(q) = 00 otherwise. 

Let Ql(t) = Ql(nt)/n, T m {t) = A^(nt)/n, and B%(t) = B%(nt)/n, where m = 1,2,...,M 
and = 1,2, ... ,K. We introduce the processes Q k = (Q k (t), t G R+), A m = (vl m (i), i G R+), 
= (KW.i G R+), = (Q^, k = 1,2,..., K), ~A n = {Xn,™ = 1,2,... ,M), andF = 
k = l,2,...,K). 

Theorem 2.1. Let, as n — > 00, i/te sequence {(Q n (0),A n ,l3 n ),n G N} o&ey £/ie LUP in 
R K x B(R + ,R M ) x D(R + ,R^) for scale n with the action functional l Q °' A ' B (q , a, b) = I Qo (g ) + 
I j4 ' S (a, b), where 1®° zs an action functional on M. K . Then the sequence {Q , n G N} obeys 
the LDP for scale n in B(R + ,R^) with the action functional 1® defined by the equality I^(q) = 
I*'(q(0))+lJ 0) (q). 

Comments and corollaries The fact that 1^ is an action functional under the hypotheses is 
a part of the assertion of the theorem. One can establish this property without relying on the 
proof of Theorem 12.11 if in addition to being an action functional the function ip is assumed to 
be convex and of superlinear growth at infinity. The latter conditions also ensure that \ A ' B is an 
action functional. The argument is similar to the one in Puhalskii |27j . 

Expression (|2..3|) for the local action functional is intuitive. If we interpret a as the vector of 
instantaneous exogenous arrival rates, b, as the vector of service rates, d, as the vector of customer 
departure rates, and e, as the matrix of rates at which customers from different exogenous arrival 
processes arrive at the servers, then (j2.3|) tells us that "the cost" for the queue- length vector 
to change at rate y given it equals x is obtained by minimising the local action functional for 
the exogenous arrivals and service subject to certain "conservation laws": the rate of the queue 
length change must equal the difference between the arrival and departure rates at the servers, the 
departure rates cannot exceed the service rates, the sum over all the servers of the arrival rates due 
to the customers orginating from a specific exogenous arrival process must equal the arrival rate of 
this process. Besides, the ratio d k /b k , provided b k > 0, can be interpreted as the fraction of time 
that server k is busy serving customers, while e km /a m can be interpreted as the fraction of stream 
m customers directed to server k. 

The definition of the function L(x,y) implies that it is piecewise constant in x in that 

L(x,y) =^l(xeF w )$jj(i/). (2.4) 
1, J 
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The summation is over all subsets I of {1, 2, . . . , K} (including the empty set) and all sets J of the 
form J = J\ x . . . x Jm, where the J m , for m = 1, 2, . . . , M, are nonempty subsets of the S m such 
that either J m C I or J m n I = 0, denotes the subset of R+ of elements x = (xi, . . . , xr-) such 
that Xk = if k G /, x& > if k £ /, Xk/wkm = mm zeS m %l/i*>lm for m = 1, 2, . . . , M if k G J m , and 
Xk/wkm > m i n ;es m xi/wi m for m = 1, 2, . . . , M if A; J m . In words, if x G F/j, then set I indexes 
the entries of x that are equal to zero and J m indexes the entries that are the smallest weighted 
fluid queue lengths for the m-th fluid arrival process. The functions ^jj are uniquely specified by 
(|2.4I) . It is easy to see that these functions are action functionals on R-^. If, moreover, the function 
ip is convex, then the functions ^fjj are also convex. Representation (|2.4I) shows that the Fjj are 
the domains of constant queue length dynamics. 

The functions ^jj can be written down more explicitly if more structure is imposed on the 
function ip. If we assume that ip(a,b) = tp A (a) + Ylk=l (Pk), where the functions ip^ : R+ — > 
[0, oo] are lower semicontinuous, convex, attain zero, and are not equal to zero identically, then 
minimisation with respect to b in the definition of ^ U yields the representation 

*/j(y)= inf (V, A (a)+Vvf(4) + V ?/f (4)1(4 >Mfe)) ; 

where = sup{6 fe G R+ : tp^(b k ) = 0} and #j(a, d), for a G M+ f and d G R+ , is the set of y G M x 
for which there exist matrices e = (efc m ) G R^ xM such that y = &1m — d, & t \k < a, and efc m = 
if k £jL J m . 

As a consequence of this observation and Theorem 12.11 we obtain the following result for the 
Markovian setting. For a G R+, we denote 7r(a) = a log a — a + 1. 

Theorem 2.2. Let the and B^ be independent Poisson processes with respective rates A^ 
and which are also independent of Q n (0). Let, as n — > oo, A^ — ► A m , /zj? — » /x^, and i/ie 
sequence {Q (0), n G N} o&ey i/ie LDP /or sca/e n in M. K with an action functional Then 
the sequence {Q , n G N} o&eys i/te LDP /or scale n in D(ILj.,R ) imiTi f/ie action functional 
I^(q) = I ( 3°(q(0)) + I^ ^(q). TTie function L(x,y) is of the form (|2.4|) . w/iere 



At 



M^J^)^ v A m ' ^ V ^ W V* 

For the proof of Theorem 12.21 note that by Theorem 2.3 in Puhalskii [23] the LDP 
for {(Q (0),A ,B ), n G N} in the hypotheses of Theorem 12.11 holds with ip(a,b) = 

J2m=l 7r ( a m/A m )A m + Ylk=l ^{bk/Hk)^- 



3 Proof of Theorem EH] 

The proof uses the terminology of large deviation convergence (abbreviated as LD convergence) and 
idempotent probability, which is recapitulated in the appendix, for more detail see Puhalskii |26j . 
We start with a proof outline, where we also define some concepts which illuminate the connection 
of LD convergence and weak convergence. 



Weak uniqueness and trajectorial uniqueness Let X n = (Q™(0), A n , B n ) and Y n = Q n . 

These random variables assume values in the respective metric spaces E = R^ x D(R + ,R A/ ) x 
D(R + , R K ) and E y = B(R + ,R K ). The sequence X n LD converges in distribution at rate n by 
hypotheses. We seek to prove that the Y n LD converge in distribution and find the limit. 
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Equation (|2.1j) is transformed into an equation Ft(X n ,Y n ) = 0, t £ R+, where the functions 
Ft are discontinuous. This equation enables us to establish that the sequence Y n is exponentially 
tight. Thus, there exists a subsequence (X n , Y n ) that LD converges in distribution to a certain 
idempotent process (X, Y). By hypotheses, the idempotent process X has idempotent distribution 
n^-(x) = exp(— lQ°' A ' B (it)), where x £ E^. In order to identify the idempotent distribution of Y 
we would like to relate X and Y by equations obtained as certain LD limits. Since the original 
equation F t (X n ,Y n ) = 0, t £ M + involves discontinuities, as a preliminary step we replace it with an 
equation Ft(X n , Y n , Z n ) = 0, t £ M+, where the Ft are continuous functions and Z n are additional 
random variables, which we call "latent". They assume values in a metric space E z . The sequence 
(X n ,Y n ,Z n ) is still provably exponentially tight. Taking an LD limit along a subsequence, we 
have that if idempotent variables (X,Y,Z) defined on an idempotent probability space (T, II) 
constitute an LD accumulation point of the (X n ,Y n , Z n ) for LD convergence in distribution, then 
F t (X,Y,Z) = II-a.e. (Note that one can always take E x x E Y x E z as T.) The idempotent 
distribution of (X, Y, Z) is proved to be concentrated on a set E C E x x E Y x E z in the sense 
that U((X, Y, Z) £ E x x E Y x E z \ E ) = 0. We refer to the idempotent distribution of Y, which 
is defined by U Y (y) = II (Y = y), as a weak solution of the equation Ft(X, Y, Z) = 0. If this weak 
solution is unique, then the idempotent law of Y is specified uniquely, so the Y n LD converge in 
distribution to F. As in the theory of stochastic differential equations, weak uniqueness follows 
from trajectorial uniqueness, which is defined as follows. We say that (X, Y)-trajectorial uniqueness 
on Eo holds for the equation F t (X, Y, Z) = if equalities F t (x, y, z) = and i^(x, y',z') = for 
(x, y,z) £ Eo and (x, y',z') £ Eo imply that y = y'. In the next lemma, given x, we let G(x) 
denote the set of y such that Ff(x., y, z) = 0, t £ R + , for some z, where (x, y,z) £ Eo- Note that 
G(x) = if x does not belong to the projection of Eo on E x . 

Lemma 3.1. If (X,Y) -trajectorial uniqueness on Eo holds for the equation Ft(X,Y, Z) = 0, then 
n(y = y) = sup x6G -i( y ) H(X = x). In particular, H(Y = y) is specified uniquely. 

Proof. Let U be the subset of T such that F t (X(v), Y (v), Z(v)) = and (X(v),Y(v), Z(v)) £ E 
for v £ U. By hypotheses, II(T \ V) = 0, so U(X = x,Y = y) = U({X = x,Y = y} n U). 
If v £ U and Tl(v) > 0, then the set G(X(v)) contains one element. Hence, either Yl(X = 
x, Y = y) = II(X = x) if y £ G(x) or U(X = x, Y = y) = otherwise. Consequently, 
n(Y = y) = sup xeE x n(X = x, Y = y) = sup xeG -i (y) U(X = x) = sup x6G -i (y) II x (x). □ 

Thus, the proof of the theorem is completed by establishing trajectorial uniqueness. In what 
follows, we implement this programme. 



Stochastic equations By (|2.1|) 



Qk(t) = Qfc(o) + [ 

m€Cfc o 

-Q n k(s-) 



Qk( s ~) . . Qi (g- 

< mm 

^km l^Sm- U)l m 



dA^(s) 



+ 



Wkr, 



mm 



Qi (s- 

Wlm 



^km(ns) dT m {s) 



1(55' 



( S -)>0)dB£( S ). (3.1) 
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We introduce the following latent variables 

t 

Dl(t) = J l{Ql{s-) > 0) dSl(a), (3.2) 
o 

E km (t) = / 1 < mm )dA m (s) 

J \ Wfcm leS m : Wi m J 



o ^ 



+ / lPf^= min HUHd^W. (3.3) 

tyk 

They enable us to replace (|3.4a|) with a system of "continuous" equations. By (|3.1j) . 1)3.2(1 . and 
ipOjl for i G R + and k = 1, 2, . . . , K 

Ql{t) = q» + E ^(*) - ^ ( 3 - 4a ) 

t t 
J Q^s-) dDl{s) = J Q^s-) dBl(a), (3.4b) 





mm2L^L\dE' km {8) = 0, (3.4c) 

= E ^rn(t). (3.4d) 

fees 177 
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Let D k = (D k (t),t G R+) and E km = (~E km (t), t G R + ), where fc = 1,2,..., if and 
1, 2, . . . , M, W = (pi, k = 1, 2, . . . , K ) and = (E£ m , fc = 1, 2, . . . ,K, m = 1, 2, . . . , M) . The 
latent variables Z n are defined by Z n = (E n , D™) and assume values in E z = D(R + , M. KxM ) x 
B(R+, M. K ). Below, generic elements of spaces K x , K Y , and K z are denoted as follows: for space E , 
as x = (g ,a,b), where q = (g ,i, • • • ,Qo,k) G R X , a = (a m , m = 1, 2, . . . , M) G B(R + ,R M ), and 
b = (b fc , k = 1,2,..., K) G 0(R+, R K ), for space E y , as y = q, where q = (q fc , k = 1, 2, . . . , K ) G 
B(R+,R X ), and for space E z , as z = (e,d), where e = (e fc , k = 1,2,..., K) G B(R + ,R X ) and 
d = (d fe , k = 1, 2, . . . , K) G Tn,nn> ■ 



Exponential tightness As pointed out above, the hypotheses of the theorem imply that the 
X n LD converge in distribution at rate n to an idempotent variable X = (Qq,A,B) with idem- 
potent distribution 11^. Since A and B have continuous paths II-a.e., the sequence (A n ,B n ) is 
C-exponentially tight. Let us show that the sequence {A n , B n ,Y n , Z n ) is C-exponentially tight. 
We repeat the argument of the proof of Lemma 4.1 in Puhalskii [23|. By (|3.1|) . 1)3. 2|) . and (|3.3j) the 
increments of the Q k (t), E km {t) and D k {t) are majorised as follows by the increments of A m (t) 
and B k (t): for s < i 

m(t)-Qi(s)\< e ffiw-iwi + Kw-^)!, 

|^(t)-^(,)|<|^(t)-^( S )|, (3.5) 
-^L(«)l < PC(*) -^CWI- (3-6) 
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Since the sequences {A^, n G N} and {B^, n G N} are C-exponentially tight and the sequence 
{Q (0), n £ N} is exponentially tight, the above inequalities imply by ()A.1|) that the sequences 
{Q k , n £ N}, {E km , n G N} and {D k , n G N} are C-exponentially tight. Therefore, the sequences 
Y n and Z n are C-exponentially tight, so the sequence {(A n , B n ,Y n , Z n ), n G N} is C-exponentially 
tight. As a consequence, the sequence {(X n ,Y n , Z n ), n G N} is exponentially tight. 



Large deviation limit Let the (X n , Y n , Z n ) LD converge along a subsequence to an idempotent 
process (X, Y, Z) defined on an idempotent probability space (T, II), where X = (Qq, A, B), Y = Q, 
and Z = (E,D). Our ultimate goal is to show that 11^ (q) = Tl(Q = q) is specified uniquely. Note 
that n-a.e. A(0) = G R A/ , B(0) = D(0) = G R K , and E(0) = G M^ xM . Let us show that 
II-a.e. the idempotent processes D k are absolutely continuous, nonnegative and nondecreasing. 
Since P(25£(t) -D%(s) < 0) = for t > s and liminfn^oo P(5£(i) - 5£(s) < 0) 1 /™ > II(£> fc (t) - 
D k (s) < 0), we obtain that II(Dfc is not nondecreasing) = sup s<t II(Dfc(t) — D k (s) < 0) = 0, 
so D k is nondecreasing II-a.e. It is therefore nonnegative. Next, by (|3.5|) P(\D k (t) — D k (s)\ > 
\B n k {t) -B n k {s)\) =0, so similarly to the above U(\D k (t) - D k (s)\ > \B k (t) - B k (s)\) = 0. Since the 
function B k is absolutely continuous II-a.e., it follows that so is D k and D k (t) < B k (t). A similar 
argument applied to the E km and using (|3.6|) shows that the E km are II-a.e. absolutely continuous 
and nondecreasing. 

Letting n — > oo in (|3.4aj) . (|3.4b|) . (|3.4cj) and l)3.4d|) we obtain by the continuous mapping 
principle that II-a.e. 

Qk(t) = Qo,k + ^2 E km (t) - D k (t), 

t t 
Q k (s)dD k (s) = [ Q k (s)dB k (s), 





t 

Qk(s) . Qi{s) . , 

mm ) dE km (s) = 0, 







An(£) = E km (t). 



Trajectorial uniqueness We take as E the subset of M A ' x B(M + , M 3X+M+/0/+1 ) of elements 
((/o, q, a, b, e, d) such that q(0) = go? the functions q, a, b, d and e are componentwise nonnegative 
and absolutely continuous, the functions a, b, d and e are nondecreasing and start at 0, and the 
following relations hold for k = 1, 2, . . . , K, m = 1, 2, . . . , M, t G M+ 

qfc(i) = ^2 ^ m {t) -dfe(i), q fe (0) = g 0) fc, (3.8a) 

qfc(*)(d fe (t)-b fc (t)) =0, (3.8b) 
mm e fem (i) = 0, (3.8c) 

V Wkm l£S m Wlm J 

M*) = X] dfc ^) ^ k fc (t). (3.8d) 

fees /"/I 

We have proved that H((Q , A, B,Q, E, D) g" E ) = 0. Hence, H(Q = q) = if q either is not 
absolutely continuous, or componentwise nonnegative, or it is not obtained as a solution of the 
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latter system of equations. We next prove that this system of equations uniquely specifies q given 
qo, a and b. 

Let q' be another solution, i.e., 

= E ®fcm(') ~ **fc(*)j 4(0) = 9b,fc> 
q*(«)(j*(0-b*(t)) =0, 

(3M m i, l3 M), Uf) = , 

V U; fcm leS m Wlm J 

k m (t) = £ e'kmit), <(t) < b fc (t). 
fees 

It suffices to prove that 

d K 

-^|q fe (i)-qUi)l<0. (3.9) 

fc=l 

We have 

, K K 

Jt £|q*(t) - «!*(*) I = E(M') " q*(*))i(q*W > q*(*)) 
fe=i fc=i 

+ (q' fc (t) - q*(*))l(q*(*) < q*W)) • (3-10) 

By (l3~3aT) 

- q*(t) = E (^(*) " «fcm(*)) + (<*'*(*) " d*(t)) • (3.11) 

If qjfe(£) > Qfc(i)) then qfc(t) > 0, so by ()3.8b|) dfc(i) = bfc(i), which implies by the inequality 
dfc(i) < b fe (t) and (l3~TB that q fc (i) - q' fe (t) < £ meCfc (<W*) - e^W) a.e on the set {q fe (i) > 
q ^)}. S imilarly, q' fc (t) - q fe (t) < X) meC 7 fe (e' fcm (t) ~ ^kmif)) a.e on the set {q' fc (t) > q fc (i)}. Hence, 

by grmji 

, if M 

| £|q*(t) " «!*(*) I <EE ((WO " e' fcm (t))l(q fc (t) > «&(*)) 

fc=l m=l fee5 m 

+ (e' fe m(*) - e fem (t))l(qfc(0 < q£(t))) . 

We prove that on the right-hand side each sum over S m is nonpositive. 

If mmi eSm qi(t)/wi m = minjeS TO q^(i)/u;j m , then o n the set where q fc (t) > q' fc (i) we have that 
Hk(t)/wkm > q.' k (t)/w km > tnini eSm <ll(t)/wi m , so by (|3.8c|) e fem (i) = 0. Similarly, e' fcm (t) = a.e. 
on the set where q^(t) > qfe(i). The required property follows. 

If mmi eSm <ii(t)/wi m < mmi e s m ^{t)/wi m , then, analogously to the preceding argument, 
6fcm(£) = a - e - on Lne seL where qfe(f) > ql(i)) and the sum in question is not greater than 
EfceS m (^ m (*)-efcm(*)) = a m (t)-a m (t) = 0. Similarly, if raiaj e s m qi(t)/wi m > mim eSm ^(t)/wi m , 
then e^, m (i) = on the set where qfc(i) < q^(i) so that this sum is not greater than 
^/ceS m (^mW ~~ ^fcm(O) = 0- Inequality (|3.9|) has been proved. 
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Evaluating the limit idempotent distribution By Lemma 13. II 

n(Q = q)= sup U(Q = q ,A = a,B = h), (3.12) 

(<70,a,b)GA(q) 

where A(q) is the set of (qo,&, b) such that (|3.8a|) . (|3.8b|) . Q3.8c|) and ()3.8d|) are satisfied. 

We conclude the proof by evaluating the right-hand side of (|3.12|) . Let I(q) = — log II (Q = q). 
Then 

oo 

I(q) = inf (l Qo (q ) + r 4 ' B (a,b)) = I Qo (q(0)) + inf / ^(a(*),b(t))rft, (3.13) 

( 50 ,a,b)eA(q) (a,b)eA(q) J 



where A(q) is the set of (a, b) such that (|3.8a[) . (|3.8b|) . (|3.8c() and (|3.8d|) are satisfied with q(0) 
as qg. The definition of L(x,y) implies that ip(a(t), b(t)) > L(q(i),q(i)) a.e. for (a, b) G A(q). 
Hence, I(q) > I Q (q). 

In order to prove the reverse inequality, we assume that I Q (q) < oo, so L(q(t),q(t)) 
a.e. Since the infimum in the definition Q2.3JI of L(x,y) is attained if finite, the sets T(t),t G R+, 
consisting of (a, b) G JV(q(i), q(t)) such that ip(a,b) = L(q(i), q(i)) are nonempty for almost all t. 
Also, the graph {(i, (a, 6)) € R+ x ~R M+K : (a, 6) G r(t)} is measurable with respect to the product 
of the Lebesgue cr-algebra on R + and Borel cr-algebra on W M+K . Therefore, by a measurable 
selection theorem (see, e.g., Clark jSj) there exist Lebesgue measurable functions a and b such that 
(a(t), b(i)) G T(t) a.e. Letting a(i) = J * a(s) (is and b(t) = J ' b(s) ds, we obtain that (a, b) G A(q) 
and / °° ip(&(t), h(t)) dt = L(q(t), q(t)) dt. Thus the infimum on the right of (j3,13|) is equal to 
/ °°L(q(t),q(t))dt, hence, I(q) = I Q (q) and n«(q) = exp(-I ( 3(q)) . 

Acknowledgements The first author is thankful to Yuri Suhov for drawing his attention to the 
join-the-shortest-queue setting. 

A Review of idempotent probability and large deviation conver- 
gence 

Let T be a set. A function II from the power set of T to [0, 1] is called an idempotent probability if 
II(r) = sup wgr II({u}), rcT and n(Y) = 1. The pair (T, II) is called an idempotent probability 
space. For economy of notation, we denote H(v) = II({i;}). A property V(v), v G T, pertaining 
to the elements of T is said to hold II-a.e. if U(V(v) does not hold) = 0. A function / from a 
set T equipped with idempotent probability II to a set T' is called an idempotent variable. The 
idempotent distribution of an idempotent variable / is defined as the set function II o / _1 (r) = 
II(/ G r), r c T'. If / is the canonical idempotent variable that is defined by f(v) = v, then 
it has II as the idempotent distribution. T'-valued idempotent variables / and /' are said to be 
independent if n(/ = v', f = v") = U(f = v')Il(f = v") for all v',v" G T'. Independence 
of finite collections of idempotent variables is defined similarly. A collection (Xt, t G R+) of Re- 
valued idempotent variables on T, where i G N, is called an idempotent process. The functions 
(X t (v), t G R+) for various v G T are called trajectories (or paths) of X. Idempotent processes are 
said to be independent if they are independent as idempotent variables with values in the associated 
function space. 

If T is, in addition, a metric space and the sets {v G T : 11(f) > a} are compact for all a G (0, 1], 
then II is called a deviability. Obviously, II is a deviability if and only if 1(f) = — log II({u}) is an 
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action functional. If / is a continuous mapping from T to another metric space T', then II o / _1 is 
a deviability on T'. As a matter of fact, for the latter property to hold, one can only require that 
/ be continuous on the sets {veT: 11(d) > a} for a G (0, 1]. In general, / is said to be a Luzin 
idempotent variable if II o f~ l is a deviability on T'. 

Let {P n , n G N} be a sequence of probability measures on a metric space E endowed with 
Borel (j-algebra and let II be a deviability on E. Let m n — > oo as n — > oo. The sequence 
{P n , n G N} is said to large deviation converge (LD converge) at rate m n to II as n — > oo if 

/ \ l/rrin 

limn^oo I / ,. — sup ze jg f(z)Yl(z) for every bounded continuous R+ -valued func- 

tion / on E. This definition is equivalent to requiring that the inequalities limsup n ^ oc P n (i ? ) 1//m " < 
II(i ? ) and liminfn^oo P n (G) 1//m " > 11(G) hold for all closed sets F and all open sets G, respec- 
tively. Therefore, the sequence {P n , n G N} LD converges at rate m n to II if and only if it obeys 
the LDP for scale m n with action functional I(z) = — log n(z). We favour the term "LD conver- 
gence" over "the LDP" as being more natural for our approach. The deviability II is said to be 
an LD limit point of the P n for rate n if each subsequence {P nji , k € N} of {P n , n € N} contains 
a further subsequence {Pn fc; , I £ N} that LD converges to II at rate as I — > oo. The sequence 
{P n , n G N} is said to be exponentially tight on order m n if for arbitrary e > there exists a 
compact subset E of E such that limsup n ^ oc P n (E \ E) l / mn < e. An exponentially tight sequence 
possesses LD limit points. Thus, one can prove LD convergence of the P n by proving that expo- 
nential tightness holds and that there is a unique LD limit point. We will say that a sequence of 
random variables with values in a metric space is exponentially tight if so is the sequence of their 
laws. 

LD convergence of probability measures can be also expressed as LD convergence in distribution 
of the associated random variables to idempotent variables. In the setting of stochastic processes, 
this point of view enables one to consider the LD limit as a dynamical system rather than as 
"a mass function" on the space of trajectories. We say that a sequence {X n , n G N} of random 
variables defined on probability spaces (Q n ,J- n ,P n ) and assuming values in E LD converges in 
distribution at rate m n as n — > oo to a Luzin idempotent variable X defined on an idempotent 
probability space (T, II) and assuming values in E if the sequence of the probability laws of the 
X n LD converges to the idempotent distribution of X at rate m n . Conversely, LD convergence 
of a sequence {P n , n G N} of probability measures on E to a deviability II on E can expressed 
as LD convergence in distribution if one considers the canonical setting. The continuous mapping 
principle, known as the contraction principle for the LDP, states that if the X n LD converge in 
distribution to X and / is a continuous function from E to another metric space, then the f(X n ) 
LD converge in distribution to f(X). We will use the extension, also referred to as the continuous 
mapping principle, where the function / is allowed to be a measurable function that is continuous 
only a.e. with respect to the idempotent distribution of X. For a detailed discussion, see Garcia 
16 . The definition of a limit point for LD convergence in distribution is similar to that for LD 
convergence of probability measures. 

If the X n are stochastic processes with trajectories in a Skorohod space B(R + ,R^), then the 
sequence X n is said to be C-exponentially tight on order m n if it is exponentially tight on order 
m n and each LD limit point II of the distributions of the X n is an idempotent distribution of a 
continuous-path idempotent process in the sense that n(D(R+, R )\C(R+, R )) = 0. The sequence 



12 



X n is C-exponentially tight on order m n if and only if 

lim limsupP(|A n (0)| > L) 1/mn = 0, 
limlimsupP( sup \X n (t) - X n (s)\ > e) 1/mn = 0, T € R+, e > 0. ( A - 1 ) 

(5^0 71-+00 s,t6[0,T]: 
|s-*|<<5 

If the sequence X n is C-exponentially tight, then a limit point of the X n for LD convergence in 
distribution may be considered as an idempotent process with trajectories in C(R + , R^). Converesly, 
if the X n LD converge in distribution to a continuous-path idempotent process, then the sequence 
X n is C-exponentially tight. 

We also note that, as a consequence of the continuous mapping principle, if a sequence of stochas- 
tic processes (X n , Y n ) assuming values in R £l and R £2 , respectively, LD converges in distribution in 
B(R + ,R £l ) x D(R + ,R £2 ) to a continuous-path idempotent process, then the LD convergence also 
holds in D(R + ,R £l x R £a ). 
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